REVEL: An Ensemble Method for Predicting the Pathogenicity of Rare Missense Variants.

نویسندگان

  • Nilah M Ioannidis
  • Joseph H Rothstein
  • Vikas Pejaver
  • Sumit Middha
  • Shannon K McDonnell
  • Saurabh Baheti
  • Anthony Musolf
  • Qing Li
  • Emily Holzinger
  • Danielle Karyadi
  • Lisa A Cannon-Albright
  • Craig C Teerlink
  • Janet L Stanford
  • William B Isaacs
  • Jianfeng Xu
  • Kathleen A Cooney
  • Ethan M Lange
  • Johanna Schleutker
  • John D Carpten
  • Isaac J Powell
  • Olivier Cussenot
  • Geraldine Cancel-Tassin
  • Graham G Giles
  • Robert J MacInnis
  • Christiane Maier
  • Chih-Lin Hsieh
  • Fredrik Wiklund
  • William J Catalona
  • William D Foulkes
  • Diptasri Mandal
  • Rosalind A Eeles
  • Zsofia Kote-Jarai
  • Carlos D Bustamante
  • Daniel J Schaid
  • Trevor Hastie
  • Elaine A Ostrander
  • Joan E Bailey-Wilson
  • Predrag Radivojac
  • Stephen N Thibodeau
  • Alice S Whittemore
  • Weiva Sieh
چکیده

The vast majority of coding variants are rare, and assessment of the contribution of rare variants to complex traits is hampered by low statistical power and limited functional data. Improved methods for predicting the pathogenicity of rare coding variants are needed to facilitate the discovery of disease variants from exome sequencing studies. We developed REVEL (rare exome variant ensemble learner), an ensemble method for predicting the pathogenicity of missense variants on the basis of individual tools: MutPred, FATHMM, VEST, PolyPhen, SIFT, PROVEAN, MutationAssessor, MutationTaster, LRT, GERP, SiPhy, phyloP, and phastCons. REVEL was trained with recently discovered pathogenic and rare neutral missense variants, excluding those previously used to train its constituent tools. When applied to two independent test sets, REVEL had the best overall performance (p < 10-12) as compared to any individual tool and seven ensemble methods: MetaSVM, MetaLR, KGGSeq, Condel, CADD, DANN, and Eigen. Importantly, REVEL also had the best performance for distinguishing pathogenic from rare neutral variants with allele frequencies <0.5%. The area under the receiver operating characteristic curve (AUC) for REVEL was 0.046-0.182 higher in an independent test set of 935 recent SwissVar disease variants and 123,935 putatively neutral exome sequencing variants and 0.027-0.143 higher in an independent test set of 1,953 pathogenic and 2,406 benign variants recently reported in ClinVar than the AUCs for other ensemble methods. We provide pre-computed REVEL scores for all possible human missense variants to facilitate the identification of pathogenic variants in the sea of rare variants discovered as sequencing studies expand in scale.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Computational approach towards identification of pathogenic missense mutations in AMELX gene and their possible association with amelogenesis imperfecta

Amelogenin gene (AMEL-X) encodes an enamel protein called amelogenin, which plays a vital role in tooth development. Any mutations in this gene or the associated pathway lead to developmental abnormalities of the tooth. The present study aims to analyze functional missense mutations in AMEL-X genes and derive an association with amelogenesis imperfecta. The information on miss...

متن کامل

Calibration of multiple in silico tools for predicting pathogenicity of mismatch repair gene missense substitutions.

Classification of rare missense substitutions observed during genetic testing for patient management is a considerable problem in clinical genetics. The Bayesian integrated evaluation of unclassified variants is a solution originally developed for BRCA1/2. Here, we take a step toward an analogous system for the mismatch repair (MMR) genes (MLH1, MSH2, MSH6, and PMS2) that confer colon cancer su...

متن کامل

Analysis of Missense Mutations of CX3CR1 Gene in Patients with Recurrent Pregnancy Loss Using Bioinformatics Tools

Introduction: Abortion is a common complication that refers to the early termination of pregnancy with the death of the fetus before the 20th week of pregnancy. Previous studies show that many genes are involved in this disease, including the CX3CR1 gene, which is one of the inflammatory response genes in the immune system. The pathogenicity of these variants was determined in this study using ...

متن کامل

Functional assays for classification of BRCA2 variants of uncertain significance.

The assessment of the influence of many rare BRCA2 missense mutations on cancer risk has proved difficult. A multifactorial likelihood model that predicts the odds of cancer causality for missense variants is effective, but is limited by the availability of family data. As an alternative, we developed functional assays that measure the influence of missense mutations on the ability of BRCA2 to ...

متن کامل

Rapid functional analysis of computationally complex rare human IRF6 gene variants using a novel zebrafish model

Large-scale sequencing efforts have captured a rapidly growing catalogue of genetic variations. However, the accurate establishment of gene variant pathogenicity remains a central challenge in translating personal genomics information to clinical decisions. Interferon Regulatory Factor 6 (IRF6) gene variants are significant genetic contributors to orofacial clefts. Although approximately three ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • American journal of human genetics

دوره 99 4  شماره 

صفحات  -

تاریخ انتشار 2016